Automatic home video abstraction using audio contents
نویسندگان
چکیده
With the increasing number of people who can afford to make videos to record their lives, home videos play more and more important role in people’s lives. Video abstraction is an efficient way to help review such a huge amount of home videos. In this paper, an automatic home video abstraction method mainly using audio contents is presented. The audio contents are first segmented and classified as speech, music, silence and special sounds basing on audio short-time features and morphology. Then special sounds are further categorized as songs, laughter, applause, scream and others using Hidden Markov Model (HMM). After that, motion level and blur degree are acquired using the video contents. Finally, video segments containing special effects, such as speech, laughter, song, applause, scream, and specified motion level and blur degree, are extracted as the main parts of the abstract. The remaining parts of the abstract are generated using key frame information. The experimental results show that the proposed algorithm can extract desired parts of home video to generate satisfactory video abstracts.
منابع مشابه
Audio and video combined for home video abstraction
With the increasing number of people who can afford to make videos to record their lives, home videos play more and more important role in multimedia. Video abstraction is an efficient way to help review such a huge amount of home videos. In this paper, a home video abstraction technique combining audio and video features is presented. The audio contents are firstly classified as silence, pure ...
متن کاملVideo Abstraction in H.264/AVC Compressed Domain
Video abstraction allows searching, browsing and evaluating videos only by accessing the useful contents. Most of the studies are using pixel domain, which requires the decoding process and needs more time and process consuming than compressed domain video abstraction. In this paper, we present a new video abstraction method in H.264/AVC compressed domain, AVAIF. The method is based on the norm...
متن کاملAutomatic Music Transcription using Audio-Visual Fusion for Violin Practice in Home Environment
Violin practice in a home environment, where there is often no teacher available, can benefit from automatic music transcription to provide feedback to the student. This paper describes a high performance violin transcription system with three main contributions. First, as onset detection is an important but challenging task for automatic transcription of pitched non-percussive music, such as f...
متن کاملDesign of the VoD System for High-Quality Video and Audio with D1 over IP
Internet broadcasting and streaming contents have recently been attracting a great deal of attention, despite their inadequate content quality. The demand for such services is projected to continue to increase in the near future, and streaming contents are expected to play a major role among applications for the next-generation Internet [1] [2]. With the further development of DVTS, it will cer...
متن کاملAudiovisual Sensing of Human Movements for Home-care and Security in a Smart Environment
This paper presents the necessity and possibility of smart sensing using a multitude of sensors such as audio and visual sensors for human movement detection for home-care and home security applications in a smart environment. We define an event and spatial relationship based approach to the problem. Use of multisensory information to even detection is proposed and its prototype implementation ...
متن کامل